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For a macroscopic, isolated quantum system in an unknown pure state, the expectation value 
of any given observable is shown to hardly deviate from the ensemble average with extremely high 
probability under generic equilibrium and non-equilibrium conditions. Special care is devoted to the 
uncontrollable microscopic details of the system state. For a subsystem weakly coupled to a large 
heat bath, the canonical ensemble is recovered under much more general and realistic assumptions 
than those implicit in the usual microcanonical description of the composite system at equilibrium. 
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Basic questions of statistical physics have gained re- 
newed interest with the discovery of various work and 
fluctuation theorems A further topic which has at- 
tracted much attention concerns the foundation of the 
canonical formalism 0, i, i, H. One of its key ingre- 
dients consist in shifting the focus from the traditional 
statistical equilibrium ensembles back to the role and pre- 
dictability of one single experimental realization of a sys- 
tem (and its environment), described theoretically by a 
quantum mechanical pure state. In essence, the main 
message of the seminal works [l, Q , for which the name 
"canonical typicality" has been coined in 0], is as fol- 
lows. Consider the usual canonical setup, i.e. an isolated 
"super-system", compound of a "large" thermal bath B, 
a comparatively "small" subsystem of actual interest S, 
and a negligibly weak coupling between them. Those 
energy eigenstates of the compound S-f-B with energy- 
eigenvalues in the interval [E — AS, E] span a Hilbert 
space, from which we pick at random a pure state \'ip). 
The corresponding projector gives rise to a mixed 

state PS of the subsystem S by tracing out the bath de- 
grees of freedom. Now, the remarkable finding of Refs. 
[2, 3 is that PS will be extremely close to the standard 
canonical density operator pcan of the subsystem S for the 
overwhelming majority of random pure states hence 
the name "canonical typicality" . In other words, what- 
ever is the (unknown) pure state of the compound S-l-B, 
the outcome of any experiment on the subsystem S is 
practically the same as if it were in the canonically mixed 
state Pcmi- For a more detailed, precise, and also more 
general exposition we refer to the original Refs. [3, 
Further, it should be pointed out that essentially the 
same conclusion could be inferred from formula (C.17) 
of the formidable prior work For less general sys- 
tem classes and/or after performing an additional time 
average, closely related results have been obtained even 
earlier in Refs. 

Here, we will show that a quite similar "typicality" 
property already holds for any isolated system, even 
when it cannot be decomposed into subsystem S and bath 
B. In the special case that such a decomposition is pos- 
sible, the original "canonical typicality" is recovered by 
"tracing out the bath" , thereby shedding also new light 
on the role of entanglement. A further main point is to 



abandon the quite unrealistic assumption of the previ- 
ous works [1, [sj that all energy eigenstates belonging to 
the preset energy interval \E — AS, E] contribute to the 
pure state with equal probabilities and all the other 
energy eigenstates are excluded. Rather, in reality one 
only knows the occupation probabilities of the energy lev- 
els very roughly and hence the unknown details should 
not matter in the final results. This problem (and our 
solution) is clearly not restricted to the issue of typical- 
ity but concern the standard microcanonical formalism 
in general. 

Setup: We consider a quantum mechanical system, 
whose Hilbert space TL is spanned by the orthonormal 
basis N < oo. Hence, any pure state |-0) is of 

the form 

where c„ G C, c := (ci, cat), ||c|| := y^J2 l^nP) and 
the sum runs from n = 1 to A^. The division by ||c|| will 
be particularly convenient for our purposes. 

As in Refs. dUi], we assume that the system is 
in some pure state \4>) € Ti. but we do not know which 
one. In other words, the c„ in ([T]) are randomly drawn 
from some probability density p(c). Denoting the corre- 
sponding ensemble average of any function g(c) by 

r N 

g{c) := / g{c)p{c) [| d(Rec„) d(Imc„) , (2) 

n=l 

the expectation value of an arbitrary observable A = A'' : 
H ^ H takes the form 

(A) IMm^i^pA) (3) 
p := MM = 5](c„c*„/||c|P) \n){m\ . (4) 

For infinite dimensional systems, well defined limits N — > 
oo in ([TJ31) and later on are tacitly taken for granted. 

Next we turn to our two key assumptions regarding 
p(c): (i) The Cn in ([T]) are statistically independent and, 
moreover, c„ and e*'''"c„ are equally likely for arbitrary 
phases tpn, or equivalently, p(c) is of the form 

N 

P{c)=Y[pn{\Cn\) ■ (5) 
n=l 
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As a consequence, the density operator (|4]) takes the form 

p = ^p„|n)H , p„:=|c„|7l|c|P (6) 

with p„ > and ^ p„ = 1. (ii) The mixed state p has a 
low purity, i.e. 

trp2=^p2^<l, (7) 

or equivalently, max/9„ <C 1, or equivalently, there are 
not just a few dominating p„ (summing up to almost 
unity). Before justifying these two assumptions, we show 
what can be concluded from them. 

Typicality: Our first objective is to show that 
is typically very close to the average ([3]), i.e. 

4 := mA\^) - {A)? (8) 

is small in an appropriate sense. Postponing the formal 
proof to a later paper, we adopt here a more heuristic 
line of reasoning, similarly in spirit to Ref. 0]. We first 
consider the deviation of s(c) :— \\c\\^ from its average 

^ = = Slc^l- Eq. ([5]) implies that the |c„p are 

independent random variables and Eq. ([7]) that a large 
number of them significantly contributes to the sum s. 
Taking for granted finite variances 

9n {\cl\-W\?/W^ =mW^^l, (9) 

the law of large numbers implies that s(c) — s is an un- 
biased random variable with a very small standard de- 
viation compared to s. Hence, for our present purpose 
of estimating ([8]) we can replace ||c|| in ([T]) in very good 
approximation by s^/"^ . As a consequence, one finds from 
that |c^J = PnS. Similarly, introducing ([1]) with 
||c|| ~ s^/^ into dS]) and exploiting ^ yields 

2 X ^ "j 2 2 \ ^nm) \ Amn \ ~^ AjuiAjyi^ 

n,m 

A A-{A) , Anm {n\A\m) . (10) 

Observing that |c^c^| = PnPmS^ for n ^ m according 
to ([5]) and jc^ | = p„ s, that |c^| = {qn + 1) pl^s^ according 
to ([9]), and that^p„p,„ A„„Am„ = (X^Pn^nn)^ = 
according to (|3l6ll0p . yields 

(^A = ^PnPmAnmA,nn+^{qn~l)plAl^ -(H) 

Exploiting again (|3l6ll0p . the first sum can be identi- 
fied with tr(pA)^. Performing this trace with the help 
of the eigenvectors and eigenvalues Ui, of A yields 
X Oi/fl^ From pH)) it follows that dmin ■— 

mina,y < 0, amax ■— maxa^ > 0, and hence Aa :~ 
dmax — dmin > \d,u\ for any ly. As a consequence, the 
first sum in ([TT|) is bounded by Z\| trp^. Similarly, in the 
second sum we exploit that < A^, yielding 

a2 < Al(max(7„)(trp2) . (12) 



In the special case that p is of the standard microcanon- 
ical form (see below), the same result also follows from 
Eq. (C.17) in [|. 

Chebyshev's inequality implies for any given e > 
that aj I is an upper bound for the probability that 
— (^)l exceeds e. Exploiting one finally 

infers for K observables {A/cj^Lj^ and any e > that 

Prob fmax|(^|Afc|?/>) - (Afe)|/AA, > e) 

\k<K J 

< is:(maxq„) (trp2)/e . (13) 

n 

This is the first main result of our paper. With (fTO|) 
one sees that /S.a equals the difference between the max- 
imal and minimal eigenvalues A and hence quantifies 
the full range of all a priori possible values of {■4>\A\il}). 
In (fTS]) we tacitly excluded trivial observables Ak with 
Ayij. = 0. The qn in ^ and hence maxg^ are dimen- 
sionless, non-negative numbers, typically of the order of 
unity. E.g. any Gaussian factor p„ in (O yields (j„ = 1. 
Hence, (jl3p with ([7]) imply typicality: a randomly sam- 
pled pure state \ip) G H is very likely to yield expecta- 
tion values {ip\Ak\il') very close to the ensemble averages 
{Ak) = tr(pAk) simultaneously for a quite large number 
K of arbitrary but fixed observables Ai, ...,Ak- 

Generalized microcanonical formalism: So far, typical- 
ity (jl3p applies to any quantum mechanical system sat- 
isfying ([5]) and ([7]). Next we specifically justify these 
assumptions (O, ([7]) for an isolated system at ther- 
mal equilibrium with / = 0(10^'^) degrees of freedom 
and with \n) being the eigenvalues of the Hamiltonian 
H ^Y.En\n){n\, En>En-i, Ei> 

The assumption that coefficients c„ and e''^"'Cn occur 
with equal probability in ([1]) is quite suggestive. Indeed, 
upon time evolution, the eigenvectors |n) acquire factors 
of the form e"'^"*/^. Taking for granted that p(c) does 
not change with time at thermal equilibrium, the invari- 
ance under c„ <—>■ e^^'^Cn follows under rather mild and 
generic incommensurability conditions for the i?„. Ex- 
ploiting this property, one readily concludes that c„ = 
for all n and that c„Cm = for all n 7^ to. In other words, 
the c„ are uncorrelated. This does not yet imply inde- 
pendence in principle, but in practice it almost always 
does, and hence assumption ([5]) is reasonable. 

The starting point of the seminal works 0, Q is the 
assumption that in ([1]) all coefficients Cn correspond- 
ing to energies En within some preset energy interval 
\E — AE, E] are "equally likely" , while all other c„ are 
zero. Denoting by c' the vector of all non-zero c„ and by 
c" those which must be zero, this means that p{c) can be 
so chosen that all c' of equal length ||c'|| must be equally 
probable and thus p{c) must be of the form g(| |c'| |)5(c") 
for some (properly normalized, non-negative) function 
g. Further, the division by ||c|| in (jT]) impfies that any 
such g actually yields the same distribution of vectors 
lip). Choosing a Gaussian g^ the c„ can thus without 
loss of generality be considered [2] as independent, Gaus- 
sian, and satisfying ([5]). Moreover, p from ^ becomes 
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the standard microcanonical density operator pmic with 
equal weights p„ > if i?„ G [i? - AS, E] and Pn — 
otherwise 0- 

On one hand, our present approach thus includes the 
standard microcanonical formalism [7,] and the starting 
point of Rcfs. as special cases. On the other hand, 

the above observation that an entire class of different p(c) 
actually yields - due to the division by ||c|| in ([T]) - the 
same distribution of vectors \ip) still remains true, and 
hence the assumption that one of those equivalent p{c) 
satisfies ([5]) is very weak indeed. On top of that, our 
above proof of typicality can even be significantly gen- 
eralized beyond the independence assumption ([5|) itself. 
In particular, the Gaussian adjusted projected measures 
(GAP) from "d] are still admissible. 

Before turning to condition ([7]), we recall some stan- 
dard notions and general properties regarding the energy 
spectrum {E„} 7]. Denoting the number of states within 
[E — AE, E] by Q{E) and Boltzmann's constant hy ks, 
entropy and temperature follow as 

S{E) kg \nn{E) , T{E) 1/S'{E) . (14) 

One finds 01 that Q{E) is a very rapidly increasing 
function of E with typical values in range of lO'-^^-''^ 
/ = 0(10^'^). Hence, fl{E) is largely independent of 
AE (provided AE ^ ksT), and its derivative can be 
identified with the density of states, 

!]'(£;) = ^ ,5(i;„ - i;) , (15) 

where the delta functions arc slightly washed out to yield 
smooth functions in 

In view of this tremendous density of energy levels 
En, it is indeed quite convincing that no real system 
can be prepared such that just a few of them are popu- 
lated with appreciable probability pn, implying that ([7]) 
is indeed satisfied. In fact, an even stronger statement 
is quite plausible and will be derived dynamically else- 
where, namely that these populations p„ can be written 
in the form 

Pn = h{En) (16) 

with a smooth function h, exhibiting appreciable varia- 
tions only on scales much larger that En — En-i- 

Due to pBj) . the energy distribution p{E) :— {6{H — 
E)) can be rewritten with (|3l6p as J2PnS{En — E) — 
h{E) S{En — E) and, after washing out the delta func- 
tions according to (fT^l) , as 

p{E) ^h{E)n'{E) . (17) 

In reality, after the experimentalist has prepared the sys- 
tem as carefully as possible, the only available knowledge 
about h and hence p is that the probability density pT|) 
exhibits a relatively sharp peak (but still very wide com- 
pared to En — En-i). All further details of p{E) are 
completely fixed by the given experimental setup, but it 



is impossible to know them. The only way out is to verify 
that these details "do not matter" . Experimentally, this 
seems indeed to be the case, but theoretically it has ap- 
parently not been demonstrated so far. On the contrary, 
for the usually considered pmic, the concomitant details 
of p{E) are in fact quite unrealistic. 

As a first example, we show that the celebrated relation 

-kBtriplnp) ^ S{E*) (18) 

indeed holds for any sharply peaked p{E) with E* located 
in the peak region: Exploiting ^ and (|16p yields 

tr(plnp) ="^h{En)\nh{En) = 

= J dEh{E)\nh{E) ^5{En-E) . 

Due to (|15p and p?)) we can conclude that 

ii{p\np) = j dEp{E)\nh{E) . (19) 

The integrand is dominated by the sharp peak of p{E) 
since the possibly comparable variations of h{E) (cf. 
([T7|) are tamed by the logarithm. Hence, there exists 
an energy E* within the peak region with the property 

j dE p{E)\nh{E) :^\nh{E*) J dE p{E) . 

Likewise, Eq. (jl7p and the normalization of p{E) imply 

1 = j dEp{E) = p{E*) e = h{E*) n'{E*) e , (20) 

where e essentially represents the peak width of p{E). 
Eqs. (Ull) imply Q' = VL/keT, yielding with (fT9|- (pO| 

tr(plnp) = ~\nn{E*) -\n{e/kBT{E*)) . (21) 

Since InQ = 0(10^^) (see below (Hi])), the last term in 
(PT|) is negligible for any realistic e and with (|14p our 
proof of pS]) is completed. A somewhat similar calcula- 
tion has been performed in chapter 12.3.2 of [1] but its 
purpose and physical content is quite different. 

An analogous line of reasoning yields trp^ w 
l/il{E*) = 10-"^^°'''\ i.e. typicality 1^ is extremely 
well satisfied for a very large number K of observables. 

Canonical formalism: As in the introduction, we con- 
sider a subsystem S in weak contact with a much larger 
bath B, resulting in a compound S-l-B with a product 
Hilbert space H = Hs ® Ti-B and a Haniiltonian H — 
Hs ®Ib + '^s® Hb, where Is is the identity on Hs and 
similarly for 1b- Given Hs = J2Ej \j)ss{j\ and Hb = 
\k)BB{k\ we thus have H = J2jk Ejk\jk){jk\ with 
Ejk := Ej + E^ and \jk) :— \j}s\k}B, i-e. previous la- 
bels n now become jk. Since only subsystem observables 
A = As <8 1b are of interest, ^ can be rewritten as 

(A) = trsipcariAs) , Pcan trs(p) , (22) 
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with trs the partial trace over Hs, and similarly for tr^. 
Likewise, {ip\A\ip) — tj:{\ip){ip\ A) can be rewritten as 

{^\A\ij) = trsipsAs) , PS trs(|V')(V'|) ■ (23) 

Eq. (|13p implies "canonical typicality" in the sense that 
for the vast majority of pure states \ip) of the compound 
S+B, the corresponding mixed state of the subsystem ps 
yields practically the same result for K subsystem ob- 
servables As^i, ...jAs.k as the ensemble averaged mixed 
state Pcan- If the bath B is sufficiently much larger than 
the subsystem S then the extremely low purity of the 
compound S+B implies typicality even for all possible 
subsystem observables As, giving rise to a natural met- 
ric [3| according to which the reduced density operator 
PS itself is close to 

t'can • 

Finally, one finds that any p{E) in pT|) with a sharp 
peak near E* results in the same canonical density matrix 

p,an = CX_J,{-Hs/kBT{E*)} . (24) 

The main line of reasoning to prove is analogous 
to p^ - (PT|) . while the somewhat more tedious details 
will be presented elsewhere. In particular, implies 
that the expectation value ((22)) of arbitrary subsystem 
observables As are indeed independent of any further 
details of p{E). 

Conclusions: We have shown that the overwhelming 
majority of pure states yields practically identical expec- 
tation values for any given (not extremely large) set of ob- 
servables under conditions which are generically satisfied 
for isolated macroscopic systems at thermal equilibrium. 
A second main result is that the experimentally uncon- 
trollable and hence unknown microscopic details of the 
system state are indeed irrelevant. In particular, for the 
practically most important system-plus-bath setup, the 
canonical ensemble (|24p is recovered under much more 
general and realistic assumptions than those implicit in 
the usual microcanonical description of the composite 
system. 



The finding that (|18|) does not depend on the unknown 
details of the equilibrium ensemble p also sheds new light 
on the usual information theoretical "derivation" of the 
microcanonical ensemble [t*]: While pmic indeed mini- 
mizes the information functional trp In p, the information 
content of many other p's is almost equally low and one 
cannot conclude that practically only the exact minimum 
Pmic occurs in reality. 

The present approach generalizes the seminal works 
[IjQ on canonical typicality in two respects: The system 
needs not be a subsystem-plus-bath compound, and the 
equilibrium ensemble need not be of the quite particular 
microcanonical form. 

Given a compound S+B in a pure state |'0), a well 
know consequence of entanglement between subsystem 
S and bath B is a mixed state ps after tracing out the 
bath according to While entanglement has been 

proposed as main origin of canonical typicality in Refs. 
3] , our present findings suggest that the main root is the 
typicality property of the entire compound, which is in 
turn not entangled with any further system. 

We close with a simple but quite interesting observa- 
tion regarding systems out of equilibrium: Specifically, 
assume that the system is isolated and has reached equi- 
librium for times t < 0, while for t > an external 
perturbation sets in [Ij, giving rise to an explicitly time 
dependent Hamiltonian H(t) and a corresponding prop- 
agator U{t2,ti). Instead of propagating the states j^/;) 
beyond t = 0, we can switch to the Heisenberg picture 
and instead propagate the observables. In this way, by 
replacing for any given t > the original observable A by 
U^{t, 0)AU{t, 0), all the equilibrium typicality properties 
at t = immediately carry over to the out of equilibrium 
situation for t > 0. For not explicitly time dependent A, 
the spectrum remains invariant under time propagation, 
and hence (jl3p remains valid for any given t > with p 
being the equilibrium density operator at t = 0. 
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